Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

نویسندگان

چکیده

Device-edge co-inference opens up new possibilities for resource-constrained wireless devices (WDs) to execute deep neural network (DNN)-based applications with heavy computation workloads. In particular, the WD executes first few layers of DNN and sends intermediate features edge server that processes remaining DNN. By adapting model splitting decision, there exists a tradeoff between local cost communication overhead. practice, is re-trained updated periodically at server. Once parameters are regenerated, part must be placed facilitate on-device inference. this paper, we study joint optimization placement online decisions minimize energy-and-time device-edge in presence channel fading. The problem challenging because strongly coupled, while involving two different time scales. We tackle by formulating an optimal stopping problem, where finite horizon determined decision. addition deriving rule based on backward induction, further investigate simple one-stage look-ahead rule, which able obtain analytical expressions analysis useful us efficiently optimize decision larger scale. closed-form solution fully-connected multilayer perceptron equal neurons. Simulation results validate superior performance various structures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Inference for the Optimal Approximating Model

Abstract: In the setting of high-dimensional linear models with Gaussian noise, we investigate the possibility of confidence statements connected to model selection. Although there exist numerous procedures for adaptive (point) estimation, the construction of adaptive confidence regions is severely limited (cf. Li, 1989). The present paper sheds new light on this gap. We develop exact and adapt...

متن کامل

Optimal Inference After Model Selection

To perform inference after model selection, we propose controlling the selective type I error; i.e., the error rate of a test given that it was performed. By doing so, we recover long-run frequency properties among selected hypotheses analogous to those that apply in the classical (non-adaptive) context. Our proposal is closely related to data splitting and has a similar intuitive justification...

متن کامل

Model averaging, optimal inference, and habit formation

Postulating that the brain performs approximate Bayesian inference generates principled and empirically testable models of neuronal function-the subject of much current interest in neuroscience and related disciplines. Current formulations address inference and learning under some assumed and particular model. In reality, organisms are often faced with an additional challenge-that of determinin...

متن کامل

Bayesian Inference and Optimal Design for the Sparse Linear Model

The linear model with sparsity-favouring prior on the coefficients has important applications in many different domains. In machine learning, most methods to date search for maximum a posteriori sparse solutions and neglect to represent posterior uncertainties. In this paper, we address problems of Bayesian optimal design (or experiment planning), for which accurate estimates of uncertainty are...

متن کامل

Efficient Optimal Sensor Placement for Structural Model Based Diagnosis

This work aims to study which sensors are required to be installed in a process in order to improve certain fault diagnosis specifications. Especially, the present method is based on structural models. Thus, system models involving a wide variety of equations (e.g. linear, non-linear algebraic, dynamics) can be easy handled. The use of structural models permits to define the diagnosis propertie...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Wireless Communications

سال: 2022

ISSN: ['1536-1276', '1558-2248']

DOI: https://doi.org/10.1109/twc.2022.3165824